The Stochastic Approach for Link - StructureAnalysis ( SALSA ) and the TKC E ectR

نویسندگان

  • R. Lempel
  • S. Moran
چکیده

Today, when searching for information on the WWW, one usually performs a query through a term-based search engine. These engines return, as the query's result, a list of Web sites whose contents matches the query. For broad topic queries, such searches often result in a huge set of retrieved documents, many of which are irrelevant to the user. However, much information is contained in the link-structure of the WWW. Information such as which pages are linked to others can be used to augment search algorithms. In this context, Jon Kleinberg introduced the notion of two distinct types of Web-sites: hubs and authorities. Kleinberg argued that hubs and authorities exhibit a mutually reinforcing relationship: A good hub will point to many authorities, and a good authority will be pointed at by many hubs. In light of this, he devised an algorithm aimed at nding authoritative sites. We present SALSA-a new stochastic approach for link structure analysis , which examines random walks on graphs derived from the link structure. We show that both SALSA and Kleinberg's Mutual Reinforcement approach employ the same meta-algorithm. We then prove that SALSA is equivalent to a weighted in-degree analysis of the link-structure of WWW subgraphs, making it computationally more eecient than the Mutual Reinforcement approach. We compare the results of applying SALSA to the results derived through Kleinberg's approach. These comparisons reveal a topological phenomenon called the TKC EEect which, in certain cases, prevents the Mutual Reinforcement approach from identifying meaningful authorities.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The Stochastic Approach for Link - StructureAnalysis ( SALSA ) and the TKC

Today, when searching for information on the WWW, one usually performs a query through a term-based search engine. These engines return, as the query's result, a list of Web sites whose contents matches the query. For broad topic queries, such searches often result in a huge set of retrieved documents, many of which are irrelevant to the user. However, much information is contained in the link-...

متن کامل

The Stochastic Approach for Link - StructureAnalysis ( SALSA )

Today, when searching for information on the WWW, one usually performs a query through a term-based search engine. These engines return, as the query's result, a list of Web pages whose contents matches the query. For broad topic queries, such searches often result in a huge set of retrieved documents, many of which are irrelevant to the user. However, much information is contained in the link-...

متن کامل

The stochastic approach for link-structure analysis (SALSA) and the TKC effect

Today, when searching for information on the World Wide Web, one usually performs a query through a term-based search engine. These engines return, as the query’s result, a list of Web sites whose contents match the query. For broad topic queries, such searches often result in a huge set of retrieved documents, many of which are irrelevant to the user. However, much information is contained in ...

متن کامل

Downweighting Tightly Knit Communities in World Wide Web

We propose two new algorithms for using World WideWeb link structures to determine authority values of web pages from search queries. Both algorithms postulate an underlying latent cluster structure, in an effort to avoid the Tightly Knit Community (TKC) effect which can occur in the Kleinberg and SALSA algorithms. The first algorithm, Similarity Downweighting (SD), weights outlinks inversely w...

متن کامل

Downweighting Tightly Knit Communities in World Wide Web Rankings

We propose two new algorithms for using World WideWeb link structures to determine authority values of web pages from search queries. Both algorithms postulate an underlying latent cluster structure, in an effort to avoid the Tightly Knit Community (TKC) effect which can occur in the Kleinberg and SALSA algorithms. The first algorithm, Similarity Downweighting (SD), weights outlinks inversely w...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011